Affine-invariant diffusion geometry for the analysis of deformable 3D shapes 
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Abstract 



We introduce an ( equi- )affine invariant diffusion geometry by which surfaces that go through squeeze and shear trans- 
formations can still be properly analyzed. The definition of an affine invariant metric enables us to construct an invariant 
Laplacian from which local and global geometric structures are extracted. Applications of the proposed framework demon- 
strate its power in generalizing and enriching the existing set of tools for shape analysis. 

1. Introduction 

Diffusion geometry is an umbrella term referring to geometric analysis of diffusion or random walk processes. Such meth- 
ods, first introduced in theoretical geometry Q have matured into practical applications in the fields of manifold learning fT] 
and shape analysis ifTTl . In the shape analysis community, diffusion geometry methods were used to define low-dimensional 
representations for manifolds 1171 1211 . build intrinsic distance metrics and construct shape distribution descriptors ll2Tl[T2ll . 
define spectral signatures 1201 (shape-DNA), and local descriptors 12311 . Diffusion embeddings were used for finding cor- 
respondence between shapes lT3l and detecting intrinsic symmetries ITSl . In many settings, the construction of diffusion 
geometry boils down to the definition of a Laplacian operator. Such an operator should possess certain invariance properties 
desired in a specific application. 

In this paper, we construct ( equi- )affine- invariant diffusion geometry for 3D shapes. Affine invariance is important in 
many applications in the analysis of images ITSl and 3D shapes We first construct an affine-invariant Riemannian 
metric that allows us define an affine-invariant Laplacian, with which we in turn define affine invariant diffusion geometry 
for surfaces. This new geometry enables efficient computational tools that handle both non-rigid approximately-isometric 
deformations of the surface together with equi-affine transformations of the embedding space. We demonstrate the usefulness 
of our construction in a range of shape analysis applications, such as retrieval, correspondence, and symmetry detection. 

2. Background 

Let X denote a compact two-dimensional Riemannian manifold (possibly with boundary) representing the outer boundary 
of a physical solid object in the 3D space. The Riemannian metric tensor g is defined as a local inner product (•, ■)x on the 



1 



tangent plane T^X at each point x £ X. Given a smooth scalar field / : X — > R, its gradient V/ at point x is defined 
through the relation f{x + dr) = f{x) + (V/(a;), dr)^, where dr £ T*X is an infinitesimal tangent vector The positive 
semi-definite self-adjoint Laplace-Beltrami operator Ag associated with the metric tensor g is defined by the identity 

fAghda = -J{Vf,Vh),da (1) 

holding for any pair of smooth scalar fields f,h : X ^ R; here da denotes the standard area measure on X. Whenever 
possible, we will omit the subscript g and refer to the Laplace-Beltrami operator simply as to A. 

Assuming further that the manifold is embedded isometrically in R^* and (possibly, locally) parametrized by a regular map 
X : J7 — > M'^ over a planar domain U, the metric tensor g assumes the form of a 2 x 2 positive-definite matrix called the first 
fundamental form, whose elements are given by the inner products ~ ^t"§u'' 

The Laplace-Beltrami operator can be expressed in the parametrization coordinates as 

A - -J^d.^^g^ld„ (2) 

where we use Einstein's summation convention, according to which g = det{gij) denotes the determinant of the metric, 
(7*^ are the components of the inverse metric tensor, and repeated indices are summed over When the metric is Euclidean 

{gij = I), the operator reduces to the familiar A/ = — ( + w^l (note that we define the Laplacian with the minus sign 



in order to ensure its positive semi-definiteness). 

The Laplace-Beltrami operator gives rise to the partial differential equation 

^ + A^/(t,a:)=0, (3) 

called the heat equation. The heat equation describes the propagation of heat on the surface and its solution /(t, x) is the 
heat distribution at a point x in time t. The initial condition of the equation is some initial heat distribution /(O, x)\ if X has 
a boundary, appropriate boundary conditions must be added. The solution of (O corresponding to a point initial condition 
/(O, a;) = 5{x — x'), is called the heat kernel and represents the amount of heat transferred from x to x' in time t due to the 
diffusion process. Using spectral decomposition, the heat kernel can be represented as 



ht{x,x') = ^e-^'V.(a;)0.(a^') (4) 

i>0 



where 0^ and Ai are, respectively, the eigenfunctions and eigenvalues of the Laplace-Beltrami operator satisfying A(f)i = Xi(pi 
(without loss of generality, we assume to be sorted in increasing order starting with Aq = 0). Since the Laplace-Beltrami 
operator is an intrinsic geometric quantity, i.e., it can be expressed solely in terms of the metric of X, its eigenfunctions and 
eigenvalues as well as the heat kernel are invariant under isometric transformations of the manifold. 

The value of the heat kernel hf {x, x') can be interpreted as the transition probability density of a random walk of length t 
from the point x to the point x'. This allows to construct a family of intrinsic metrics known as diffusion metrics, 

dUx,x')= /"(/it(a;,-)-/it(a;',-))'da-^e-^'*(0,(x)-</),(a;'))', (5) 

•' i>0 

which measure the "connectivity rate" of the two points by paths of length t. 

The parameter t can be given the meaning of scale, and the family {dt} can be thought of as a scale-space of metrics. By 
integrating over all scales, a scale-invariant version of ^ is obtained, 

dlT,{x,x') = 2/ dUx,x')dt = y2-{Ux)-Mx'))^- (6) 

This metric is referred to as the commute-time distance and can be interpreted as the connectivity rate by paths of any length. 
We will broadly call constructions related to the heat kernel, diffusion and commute time metrics as diffusion geometry. 
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3. Affine-invariant diffusion geometry 



An ajfine transformation x ^ Ax + b of the three-dimensional Euchdean space can be parametrized by a regular 3x3 
matrix A and a 3 x 1 vector b since all constructions discussed here are trivially translation invariant, we will omit the vector 
b. The transformation is called special ajfine or equi-ajfine if it is volume-preserving, i.e., det A = 1. 

As the standard Euclidean metric is not affine-invariant, the Laplace-Beltrami operators associated with X and A.X are 
generally distinct, and so are the resulting diffusion geometries. In what follows, we are going to substitute the Euclidean 
metric by its equi-affine invariant counterpart. That, in turn, will induce an equi-affine-invariant Laplace-Beltrami operator 
and define equi-affine-invariant diffusion geometry. 

The equi-affine metric can be defined through the parametrization of a curve HI |4] |5] 13 [l6l |22l . Let C be a curve on X 
parametrized by p. By the chain rule, 



dC 
dp 


du\ ^ 

^ dp 


X 

dp 


d^C 
dp'' 


d'^ui 
^ dp' 


+ x "^^"^ 
dp' 



(dui\ dui du2 (du2\ 

^11 +2X12— 1-+X22 — , (7) 

\ dp J dp dp \ dp J 

where, for brevity, we denote x^ = and x^ — g ".g^ . ■ As volumes are preserved under the equi-affine group of 
transformations, we define the invariant arclength p through 



det(xi,X2, Cpp) = 1. (8) 



Plugging (|7]i into dH) yields 



dp' = dct(xi, X2, xiidit^ + 2xi2duidu2 +X22dul), (9) 
from where we readily have an equi-affine-invariant pre-metric tensor 

9ij = 9tj\9r^^^, (10) 

where Qij = det(xi, X2, x^ ). The pre-metric tensor ( fTOl i defines a true metric only on strictly convex surfaces 15); in 
more general cases, it might cease from being positive definite. In order to deal with arbitrary surfaces, we extend the 
metric definition by restricting the eigenvalues of the tensor to be positive. Representing g as a 2 x 2 matrix admitting the 
eigendecomposition G = UFU^, where U is orthonormal and T = diag{7i,72}, we compose a new first fundamental 
form matrix G = U|r|U^. The corresponding metric tensor g is positive definite and is equi-affine invariant. 

Plugging this g into ([T]i, we obtain an equi-affine-invariant Laplace-Beltrami operator Ag. Such an operator defines 
an equi-affine-invariant diffusion geometry, i.e., the eigenfunction, heat kernel, and diffusion metrics it generates remain 
unaltered by a global volume-preserving affine transformation of the shape (Figures |2H3]l. 



4. Discretization 

In order to compute the equi-affine metric we need to evaluate the second-order derivatives of the surface with respect to 
some parametrization coordinates. While this can be done practically in any representation, here we assume that the surface 
is given as a triangular mesh. For each triangular face, the metric tensor elements are calculated from a quadratic surface 
patch fitted to the triangle itself and its three adjacent neighbor triangles. The four triangles are unfolded to the plane, to 
which an affine transformation is applied in such a way that the central triangle becomes a unit simplex. The coordinates of 
this planar representation are used as the parametrization u with respect to which the first fundamental form coefficients are 
computed at the barycenter of the simplex (Figure[T]i. This step is performed for every triangle of the mesh. 

Having the discretized first fundamental form coefficients, our next goal is to discretize the Laplace-Beltrami operator 
Since our final goal is not the operator itself but its eigendecomposition, we skip the construction of the Laplacian and 
discretize its eigenvalues and eigenfunctions directly. This is achieved using the finite elements method (FEM) proposed 
in ISl and used in shape analysis in ||T9|. For that purpose, we translate the eigendecomposition of the Laplace-Beltrami 
operator = Xcp into a weak form 

J ipkAcpda = X J ipk4'da (11) 
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Figure 1 ; Left to right: part of a triangulated surface about a specific triangle. The three neighboring triangles together with the central 
one are unfolded flat to the plane. The central triangle is canonized into a right isosceles triangle; three neighboring triangles follow the 
same planar affine transformation. Finally, the six surface coordinate values at the vertices are used to interpolate a quadratic surface patch 
from which the metric tensor is computed. 



with respect to some basis {ipk} spanning a (sufficiently smooth) subspace of L'^{X). Specifically, we choose the ipk's. to be 
the first-order finite element function obtaining the value of one at a vertex k and decaying linearly to zero in its 1-ring (the 
size of the basis equals to the number of vertices in the mesh). Substituting these functions into ( fTTT) . we obtain 



ibk^^da = j {\/^k,^4>)xda= J g'^ {d,4>){dj^k) da = Xj ^k^da. (12) 

Next, we approximate the eigenfunction in the finite element basis by (/) = oii'^i- This yields 

j g'''{di^ai'>pi){dji})u)da = X i'k^aiipida, 
I I 

or, equivalently, 

/ g'-^d.i'ilJi){dj'ipk) da = >^'^ai j '4'kipida. 
I I 

The last equation can be rewritten in matrix form as a generalized eigendecomposition problem Aa = ABa solved for the 
coefficients ai, where 



aki = J g'^ {d^Tjji){djipk) da, 
bki = Ji/Jkipida, 

and the local surface area is expressed in parametrization coordinates as da = ^duidu2- 

5. Applications and results 

The proposed equi-affine-invariant Laplacian is a practical tool that can be employed in the construction of local and 
global diffusion geometric structures used in standard approaches in shape analysis, substituting the traditional non-invariant 
Laplace-Beltrami operator. In what follows, we detail the construction of such structures and show applications in shape 
retrieval, correspondence, matching, and symmetry detection. 
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Figure 2: Four eigenfunctions of the standard (second through fifth columns) and the proposed equi-affine-invariant (four rightmost 
columns) Laplace-Beltrami operators. Two rows show a shape and its equi-affine transformation. For convenience of visualization, 
eigenfunctions are overlaid onto the untransformed shape. 

5.1. Shape retrieval 

Sun et al. 1231 proposed using the diagonal of the heat kernel, ht{x, x), as a local descriptor of the shape, referred to as 
the heat kernel signature (HKS). In practice, the descriptor is computed by sampling the time parameter t at a discrete set of 
points, ti, . . . ,tn, and collecting the corresponding values of ht{x, x) into a vector p{x) = {ht^ {x, x), . . . , ht^ [x, x)). The 
HKS descriptor is isometry-invariant, easy to compute, and is provably informative 12311 . The affine-invariant HKS descriptors 
can be used in one-to-one feature-based shape matching methods l23l . or in large-scale shape retrieval applications using 
the bags of features paradigm fT7l|. In the latter, the shape is considered as a collection of "geometric words" from a fixed 
"vocabulary" and is described by the statistical distribution of such words. The vocabulary is constructed off-line by clustering 
the descriptor space. Then, for each point on the shape, the descriptor is replaced by the nearest vocabulary word by means 
of vector quantization. Counting the frequency of each word, a bag of features is constructed. The similarity of two shapes 
is then computed as the distance between the corresponding bags of features. 

To evaluate the performance of the proposed approach for the construction of local descriptors, we used the Shape 
Google framework ITTll based on standard and equi-affine-invariant HKS. Both descriptors were computed at six scales 
{t = 1024, 1351.2, 1782.9, 2352.5, and 4096). Bags of features were computed using soft vector quantization with vari- 
ance taken as twice the median of all distances between cluster centers in a vocabulary of 64 entries. Approximate nearest 
neighbor method [2| was used for vector quantization. Both the standard and the affine-invariant Laplace-Beltrami operator 
discretization were computed using finite elements. Heat kernels were approximated using the first smallest 100 eigenpairs. 

Evaluation was performed using the SHREC 2010 robust large-scale shape retrieval benchmark methodology. The dataset 
consisted of two parts: 793 shapes from 13 shape classes with simulated transformation of different types (Figure [H and 
strengths (60 per shape) used as queries, and the remaining 521 shapes used as the queried corpus. Transformations classes 
afftne and isometry+affine were added to the original SHREC query set, representing, respectively, equi-affine transforma- 
tions of different strengths of the null shape and its approximate isometry. The combined dataset consisted of 1314 shapes. 
Retrieval was performed by matching 780 transformed queries to the 534 null shapes. Each query had one correct correspond- 
ing null shape in the dataset. Performance was evaluated using precision/recall characteristic. Precision P{r) is defined as 
the percentage of relevant shapes in the first r top-ranked retrieved shapes. Mean average precision (mAP), defined as 
mAP = J2r ^('') ' fd{r), where rel{r) is the relevance of a given rank, was used as a single measure of performance. Intu- 
itively, mAP is interpreted as the area below the precision-recall curve. Ideal retrieval performance (mAP=100%) is achieved 
when all queries return relevant first matches. Performance results were broken down according to transformation class and 
strength. 

Tables [T]-|2] show that the equi-affine version of the ShapeGoogle approach obtains slightly higher precision than the 
original ShapeGoogle in all SHREC 10 transformations. We attribute this phenomenon to the smoothing effect of the second 
order interpolation. In addition, equi-affine ShapeGoogle exhibits nearly perfect retrieval under equi-affine transformations 
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Figure 3: Heat kernel signature ht{x,x) and diffusion metric ball (second and third columns, respectively), and their affine-invariant 
counterparts (fourth and fifth columns, respectively). Two rows show a shape and its equi-affine transformation. For convenience of 
visualization, the kernel and the metric are overlaid onto the untransformed shape. Plots under the figure show the corresponding metric 
distributions before and after the transformation. 




Figure 4: Examples of query shape transformations used in the shape retrieval experiment (left to right): null, isometry, topology, affine, 
affine+isometry, sampling, local scale, holes, microholes, Gaussian noise, shot noise. 

in affine transformations, wliere the original approach fails. 
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strength 



Transform. 


1 


<2 


<3 


<4 


<5 


Isometry 


100.00 


100.00 


100.00 


100.00 


99.23 


Affine 


100.00 


100.00 


100.00 


100.00 


97.44 


Iso. + Affine 


100.00 


100.00 


100.00 


100.00 


100.00 


Topology 


96.15 


94.23 


91.88 


89.74 


86.79 


Holes 


100.00 


100.00 


100.00 


100.00 


100.00 


Micro holes 


100.00 


100.00 


100.00 


100.00 


100.00 


Local scale 


100.00 


100.00 


94.74 


82.39 


73.97 


Sampling 


100.00 


100.00 


100.00 


96.79 


86.10 


Noise 


100.00 


100.00 


89.83 


78.53 


69.22 


Shot noise 


100.00 


100.00 


100.00 


97.76 


89.63 



Table 1 ; Performance (mAP in %) of Shape Google with equi-affine-invariant HKS descriptors. 



Strength 



Transform. 


1 


<2 


<3 


<4 


<5 


Isometry 


100.00 


100.00 


100.00 


100.00 


100.00 


Affine 


100.00 


86.89 


73.50 


57.66 


46.64 


Iso. +Affine 


94.23 


86.35 


76.84 


70.76 


65.36 


Topology 


100.00 


100.00 


98.72 


98.08 


97.69 


Holes 


100.00 


96.15 


92.82 


88.51 


82.74 


Micro holes 


100.00 


100.00 


100.00 


100.00 


100.00 


Local scale 


100.00 


100.00 


97.44 


87.88 


78.78 


Sampling 


100.00 


100.00 


100.00 


96.25 


91.43 


Noise 


100.00 


100.00 


100.00 


99.04 


99.23 


Shot noise 


100.00 


100.00 


100.00 


98.46 


98.77 



Table 2; Performance (mAP in %) of Shape Google with HKS descriptors. 



5.2. Global structures 

The equi-affine-invariant Laplacian can also be employed in the construction of global geometric structures. By plugging 
it into (|5]l, a family of equi-affine-invariant diffusion distances is obtained. Similarly, a truly affine-invariant version of the 
commute time metric is obtained by using the equi-affine-invariant operator in Q. These metric structures can be used in 
the Gromov-Hausdorjf framework ifTolfT?! . in which shapes are modeled as metric spaces, and the similarity of two shapes 
(X, dx) and (F, dy) is established by looking at the minimum-distortion correspondence between them. A correspondence 
is defined as a subset C <Z X x Y such that for all x ^ X there exists y ^ Y such that (x, y) G C, and vice versa, for all 
y G Y there exists x £ X such that {x,y) E C. The distortion of C is defined as 

dis(C) = max \dx{x, x') - dY{y,y')\- (13) 

{x,y}.{y,y')eC 

The Groniov-Hausdorff distance is given as the minimum of the distortion over all possible correspondences, 

dGYi{X,Y) = imnidis(C), (14) 

and serves as a criterion for shape similarity in the sense that two shapes with dG'ii{X, Y) < e are at most 2e-isometric, and, 
vice versa, two e-isometric shapes have at most dQii{X, Y) < 2e. A byproduct of this problem is the minimum-distortion 
correspondence C. 

Figure|5]shows the correspondences obtained between an equi-affine transformation of a shape using the standard and the 
equi-affine-invariant versions of the diffusion metric. Minimization of a least-squares version of ( fT4l i was performed using 
the generalized multidimensional scaling (GMDS) algorithm. In the case of the standard diffusion metric, the embedding 
distortion grew by over 6 times as the result of the transformation, while in the case of the proposed invariant diffusion metric, 
the increase was by mere 16%. 
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Figure 5: The GMDS framework is used to calculate correspondence between a shape and its isometry (left) and isometry followed by an 
equi-affine transformation (right). Matches between shapes are depicted as identically colored Voronoi cells. Standard diffusion distance 
(first row) and its equi-affine-invariant counterpart (second row) are used as the metric structure in the GMDS algorithm. Inaccuracies 
obtained in the first case are especially visible in the abdominal region. 
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5.3. Intrinsic symmetry detection 

Ovsjanikov et al. IfTSi proposed detecting intrinsic symmetries of a shape by analyzing the eigenfunctions of the Laplace- 
Beltrami operator. Intrinsic symmetry is manifested in the existence of a self-isometry f : X ^ X, under which the metric 
structure of the shape is preserved, i.e., d = d o (/ x /), where d is the commute time metric. Ovsjanikov et al. ifTSll observe 
that for any intrinsic reflection symmetry /, simple eigenfunctions of A satisfy <f>i o f ~ ±(t>i- Thus, the symmetries of X 
can be parameterized by the sign signature (si, S2, . . .); Si S {— 1, 1} such that (pi o f = Sipii. 

Given a truncated sign signature (si, . . . , sk), define an energy 



It is easy to show that E = Q for sign signatures corresponding to intrinsic symmetries, and E' w for approximate 
symmetries satisfying d k, do [f x f). The symmetry itself is recovered as 



Employing our equi-affine-invariantLaplacian, the detection of intrinsic symmetries can be made under affine transformations 
of the shape. 

Figure |6] shows an example of intrinsic symmetry detection with the method of ifTSl using standard (first row) and the 
proposed affine-invariant (second row) Laplace-Beltrami operator on an intrinsically symmetric centaur shape that underwent 
a mild isometric and affine transformation. In this experiment, we use the five first non-trivial eigenfunctions and show the 
approximate symmetries corresponding to the sign signatures producing the smallest values of E (sign signature + + + + + 
corresponding to the identity transformation was ignored). While no meaningful symmetries are detected in the first case, 
using our affine-invariant Laplace-Beltrami operator we were able to detect three approximate symmetries of the shape: hands 
and forward legs reflection (left), rear legs reflection (center), and full body reflection (right). 

6. Conclusions 

We introduced an equi-affine-invariant Laplace-Beltrami operator on two-dimensional surfaces, and showed that it can be 
utilized to construct affine-invariant local and global diffusion geometric structures. Performance of the proposed tools was 
demonstrated on shape retrieval, correspondence, and symmetry detection applications. Our results show that affine-invariant 
diffusion geometries gracefully compete with, and sometimes even outperform, their classical counterparts under isometric 
changes and in the presence of geometric and topological noise, while significantly outperforming the latter under affine 
transformations. 

Extension of the proposed equi-affine framework into fully affine invariance (including scale) could be accomplished by 
either exploiting the scale invariance property of the commute time distance, or the consideration of scale invariant signatures, 
two approaches we plan to study in the future. 
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